CDS

Accession Number TCMCG075C20044
gbkey CDS
Protein Id XP_017978671.1
Location complement(join(22333096..22333254,22333874..22334211,22334888..22335421,22336237..22336336,22336479..22336618,22337391..22337559))
Gene LOC18596567
GeneID 18596567
Organism Theobroma cacao

Protein

Length 479aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018123182.1
Definition PREDICTED: CTD small phosphatase-like protein 2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description CTD small phosphatase-like protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01009        [VIEW IN KEGG]
KEGG_ko ko:K17616        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004721        [VIEW IN EMBL-EBI]
GO:0006464        [VIEW IN EMBL-EBI]
GO:0006470        [VIEW IN EMBL-EBI]
GO:0006793        [VIEW IN EMBL-EBI]
GO:0006796        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016311        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016788        [VIEW IN EMBL-EBI]
GO:0016791        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0036211        [VIEW IN EMBL-EBI]
GO:0042578        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043412        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044267        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGCCATCTCTAAGAATGAAGGCCAAATCCAGCATGGGTTCTGTAAGAGAAAAAAATGGTCTCCGTATGTGTCAGAAGTCTAGCATGATTTGCAAAAGACCATGCTCCCATGTCAGGGTTTTCCAGCAAGGAGCTGAATTTAGTACATGTACTCAAAATTCTCATGATGATTCATTAGACTTGGAAGTGGCTTCACAAGTTGTTGCAACCAATGGAGCCAGTTCTCAGCAACTTATTTTGGATGATGACAATTCTGAGCTTCAGAAACAGCATCCAGTTTTTTTTGATTCTACGACTGTGGGAAGAATGGAATCTGCCGAAGCCTGTGCCTCAAACTTAGAGACAATATTCTCTCCTTTTCTGGAGCCAATTGTAATCCACACTGAACCAAATATTGACAATGATGCAGGGTGTAATGATGGCCCTGAAGTGCCAGCATTAGGGGCTGATGAAAGTGATGATAACAAAAGCTCGTTTGGCAGTCAAACATGTAATGTATCAGATTTCTTTATATCTGACATGATAATTGCAAGCATACCCTTTGATGCAAATGCTGTTGATGATAATATCTCTGGAACCAATTCTTTTCCTGATTTCAAGTGTTCTGAGCCAAGTATGTTGTTTGATGTGGCCGAGCAATACATGATACTGCCTTTCCTCGAGGACACTGTCAAAGCAAATGATATAAATGATGTTAATTTCTGTGAAGAAGCCACGATGGCTCAAGATAATGCTGGTTTATATGTAGCAATTGATCAGATGAGATCCTGCATCCCGGAATCTGATGTTAACTCTGACTCGGATCAAGCGGACGACTTCGATCCACAGTCATTTATAAAAAATTTACCAGAACTATCTGATGTTGTATCAAGCTTTCGACCTGCTATGGTGCCAAAGGAGGCTTGGAGAAGGAAGCCCGTAACCCTTGTGCTTGATTTGGATGAAACTCTCGTCCACTCTACACTAGAACATTGTGATAATGCAGACTTCACCTTTACAGTATTTTTCAACATGAAAGAGCACACTGTGTATGTAAAGCAGAGGCCTCACCTGCAGACATTTTTGGAGAAAGTTGCAGAGATGTTTGAAGTTGTCATCTTTACTGCAAGCCAAAGTATTTATGCAGAACAATTACTGGACATATTGGACCCACATCAAAAGCTCATATCTCGGCGAGTGTATCGTGAATCATGCATTTTTTCAGATGGAAGTTACACTAAAGATTTGACAGTTTTAGGTGTTGATCTTGCAAAAGTTGCTATAATTGATAATTCTCCACAGGTTTTCAGGCTGCAAGTGAATAATGGGATTCCTATTAAGAGTTGGTTTGATGATCCATCTGATTGTGCACTAATTTCATTACTTCCCTTCTTAGAGACTCTGGTTGATGCTGATGATGTCCGTCCTATCATTGCCAAGAAATTTGGTAACAAGGAATAA
Protein:  
MPSLRMKAKSSMGSVREKNGLRMCQKSSMICKRPCSHVRVFQQGAEFSTCTQNSHDDSLDLEVASQVVATNGASSQQLILDDDNSELQKQHPVFFDSTTVGRMESAEACASNLETIFSPFLEPIVIHTEPNIDNDAGCNDGPEVPALGADESDDNKSSFGSQTCNVSDFFISDMIIASIPFDANAVDDNISGTNSFPDFKCSEPSMLFDVAEQYMILPFLEDTVKANDINDVNFCEEATMAQDNAGLYVAIDQMRSCIPESDVNSDSDQADDFDPQSFIKNLPELSDVVSSFRPAMVPKEAWRRKPVTLVLDLDETLVHSTLEHCDNADFTFTVFFNMKEHTVYVKQRPHLQTFLEKVAEMFEVVIFTASQSIYAEQLLDILDPHQKLISRRVYRESCIFSDGSYTKDLTVLGVDLAKVAIIDNSPQVFRLQVNNGIPIKSWFDDPSDCALISLLPFLETLVDADDVRPIIAKKFGNKE